ON RATIONAL APPROXIMATION 
OF ALGEBRAIC FUNCTIONS 

JULIUS BORCEA*, RIKARD B0GVAD, AND BORIS SHAPIRO 

Abstract. We construct a new scheme of approximation of any multivalued 
algebraic function /(z) by a sequence {rn{z)}neN of rational functions. The 
latter sequence is generated by a recurrence relation which is completely de- 
termined by the algebraic equation satisfied by f{z). Compared to the usual 
Fade approximation our scheme has a number of advantages, such as simple 
computational procedures that allow us to prove natural analogs of the Fade 
Conjecture and Nuttall's Conjecture for the sequence {rn{z)}n£N in thi^ com- 
plement CP^ \ ^/ ) where X>^ is the union of a finite number of segments of real 
algebraic curves and finitely many isolated points. In particular, our construc- 
tion makes it possible to control the behavior of spurious poles and to describe 
the asymptotic ratio distribution of the family {rn{z)}„£Ti. As an application 
we settle the so-called 3-conjecture of Egecioglu et al dealing with a 4-term 
recursion related to a polynomial Riemann Hypothesis. 



1. Introduction and main results 

Rational approximants of analytic functions and the asymptotic distribution of 
their zeros and poles are of central interest in many areas of mathematics and 
physics. For the class of algebraic functions these questions have attracted special 
attention due to their important applications ranging from the convergence theory 
of Fade approximants |8()j-j!'{5| and the theory of general orthogonal polynomials 
[161 1261 Kffil IHHj to statistical mechanics [281 1^ , complex Sturm-Liouville problems 
[S], inverse scattering theory and quantum field theory [2]. 

The main purpose of this paper is to give a simple and direct construction of 
families of rational functions converging to a certain branch of an arbitrary (multi- 
valued) algebraic function f{z). While the usual Fade approximation requires the 
knowledge of the Taylor expansion of f{z) at oo, our scheme is based only on the 
algebraic equation satisfied by f{z) and has therefore an essentially different range 
of applications. Indeed, let 

k 

P(y,z) = ^Pfe_,(z)/-' (1.1) 

1=0 

denote the irreducible polynomial in (?/, z) defining f{z), that is, P{f{z),z) = 0. 
Note that P{y, z) is uniquely defined up to a scalar factor. Let us rewrite (|1.1|) as 

and consider the associated recursion of length fc + 1 with rational coefficients 
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Choosing any initial fc-tuple of rational functions IN ~ {qo{z), . . . ,qk-i{z)} one 
can generate a family {qn{z)}neN of rational functions satisfying H1.3(l for all n > fc 
and coinciding with the entries of IN for < n < ^ — 1- The main object of study 
of this paper is the family {r„(z)}„gN, where r„(z) = 

In order to formulate our results we need several additional definitions. Consider 
first a usual recurrence relation of length fc + 1 with constant coefficients 

-Un = aiUn-1 + a2Un-2 + ■ • • + a^Un-k, (1-4) 

where ak ^ 0. 

Definition 1. The asymptotic symbol equation of recurrence H1.4|l is given by 

t'' + ait^'^ + a2t^^^ + . . . + afc = 0. (1.5) 

The left-hand side of the above equation is called the characteristic polynomial of 
recurrence (11.411 . Denote the roots of 11.5|l by ti , . . . , and call them the spectral 
numbers of the recurrence. 

Definition 2. Recursion 1)1. 4|l and its characteristic polynomial are said to be 
of dominant type or dominant for short if there exists a unique (simple) spectral 
number T„iax of this recurrence relation satisfying |T„iaj;| ~ maxi<j<i: \Ti\. Other- 
wise (|1.4ll and H1.5|) are said to be of nondominant type or just nondominant. The 
number Tmax will be referred to as the dominant spectral number of H1.4|) or the 
dominant root of l|1.5|l . 

The following theorem may be found in TP, Ch. 4]. 

Theorem 1. Let fc e N and consider a k-tuple (ai, . . . ,afc) of complex numbers 
with Ok ^ 0. For any function u : Z>o — *■ C the following conditions are equivalent: 

(i) En>o = ™^ere Q2(i) = 1 + ait -I- a2t^ + ... + aut'^ and Qi{t) 
is a polynomial in t whose degree is smaller than fc. 

(ii) For all n > k the numbers Un satisfy the (fc + \)-term recurrence relation 
given by H1.4|l . 

(iii) For all n > one has 

r 

u^=J2p,{n)rl\ (1.6) 

i=l 

where ti , . . . , are the distinct spectral numbers of (|1.4|l with multiplicities 
mi, . . . ,mr, respectively, and pi{n) is a polynomial in the variable n of 
degree at most mi — 1 for 1 < i < r. 

Note that by Definition |21 the dominant spectral number Tmax of any dominant 
recurrence relation has multiplicity one. 

Definition 3. An initial fc-tuple of complex numbers {mo,ui, . . . ,Uk-i} is called 
fast growing with respect to a given dominant recurrence of the form H1.4() if the 
coefficient Kmax of T^ax ™ l|l-6|l is nonvanishing, that is, m„ = KmaxT^ax + ■ ■ • with 
Kmax 7^ 0. Otherwise the fc-tuple {mq, ui, . . . , Uk-i} is said to be slow growing. 

Let Vk = {o-ky^ + flfe-iy'^^^ + . . . + ao | fli G C, < « < fc} denote the linear 
space of all polynomials of degree at most fc with complex coefficients. 

Definition 4. The real hypersurface 'E.k C Vk obtained as the closure of the set of 
all nondominant polynomials is called the standard equimodular discriminant. For 
any family 

r(?/, zi, . . . ,Zq) = {ak{zi, . . .,Zq)y^ + afe_i(zi, . . .,Zq)y^^^ + . . . + ao{zi, . . . ,z,)} 
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of irreducible polynomials of degree at most k in the variable y we define the induced 
equimodular discriminant Sr to be the set of all parameter values (zi, . . . , Zq) S Vfl 
for which the corresponding polynomial in y is nondominant. Given an algebraic 
function f[z) defined by we denote by the induced equimodular discrimi- 
nant of (|1.2|l considered as a family of polynomials in the variable y. 

Example 1. For k — 2 the equimodular discriminant S2 C V2 is the real hypersur- 
face consisting of all solutions to ea^ — 4aoa2 = 0, where e is a real parameter with 
values in [1, 00). More information on can be found in IIUI and |12j . 

Definition 5. Given an algebraic function f{z) defined by and an initial 

fc-tuple of rational functions IN — {qo{z), . . . , qk-i{z)} we define the pole locus 
TT/./AT associated with the data (/, IN) to be the union between the zero set of the 
polynomial Pk{z) and the sets of all poles of qi{z) for < i < fc — 1. 

As we explain in §3, one of the fascinating features of Fade approximants is 
the complexity of their convergence theory. The main challenges of this theory are 
the Fade (Baker-Gammel- Wills) Conjecture and Nuttall's Conjecture (cf. [2115^). 
The general version of the former was recently disproved by D. Lubinsky |24| . 
Subsequently, V. Buslaev constructed counterexamples to this conjecture for 
some special algebraic (hyperelliptic) functions. Nevertheless, a question of central 
interest in many applications is whether the Fade Conjecture could hold for certain 
classes of algebraic functions. The first main result of this paper shows that a 
natural analog of the Fade Conjecture is always true for our approximation scheme: 

Theorem 2. Let f{z) be an algebraic function defined by Hl.l|l . For any initial 
k-tuple of rational functions IN = {go(z), . . . , (7fe_i(z)} there exists a finite set 
S/,/jv C CP \ {Ef U T fjpf) such that 

r-niz) = =t ydomiz) in CP^ \ T>f as n -* 00, 

qn-i{z) 

where ydom{z) is the dominant root of equation H1.2|l . T>f = SyUT/jjyU Y^fjN, 
and =^ stands for uniform convergence on compact subsets 0/ CP""^ \ ^Z- 

The set '^f.iN consists precisely of those points z G such that the initial k- 
tuple IN — {(7o(-z), ■ • ■ , qk-i{zy\ is slowly growing with respect to recurrence (|1.3|l 
evaluated at z (cf. Definition O . This motivates the following definition. 

Definition 6. The set T,fjN is called the set of slow growth associated with the 
data(/,/7V). 

Although S/jjv is the set of solutions of an implicit equation (cf. H2.13(l ^ it turns 
out that the set of slow growth is actually a subset of a certain discriminantal set 
S which is given explicitly in terms of the data (/, IN) (see the proof of Lemma|Hll. 

Remark 1 . In the final stage of preparation of this paper the authors noticed that 
some weaker results similar to Theorem |2 appeared in [7||S]. The latter deal with 
pointwise limits of certain families of recursively defined polynomials and are often 
used in modern statistical physics, see e.g. j2Sj and references therein. 

The next result describes the rate of convergence of the sequence {r„(z)}„gN to 
ydomiz), which proves in particular the uniform convergence stated in Theorem |2| 
Given < e < 1 set 9^ = CP^ \ O^, where is the e-neighborhood of Vf = 
'E.f U T fjN U S/,7iv in the spherical metric on CP"'^. 

Theorem 3. For any sufficiently small e > the rate of convergence of rn{z) ^ 
ydom{z) in 0e is exponential, that is, there exist constants 971 > and q G (0, 1) 
such that \rn{z) — ydom{z)\ < 9Jlg" for all z £ O^. 
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Definition 7. Given a meromorphic function g in some open set £7 C C we con- 
struct its (complex-valued) residue distribution Vg as follows. Let {z,„ | m € N} be 
the (finite or infinite) set of all the poles of g in Q. Assume that the Laurent ex- 
pansion of g at Zm has the form g{z) ~ X)-oo</<; (z-'z ' y • Then the distribution 
Ug is given by 

m>i yi<;</,„ ^ j 

where (5^^ is the Dirac mass at z,m and the sum in the right-hand side of 1)1. 7|l is 
meaningful as a distribution in since it is locally finite in O. 

Remark 2. The distribution Vg is a complex- valued measure if and only if g has all 
simple poles, see [51 p. 250]. If the latter holds then in the notation of Definition [7| 
the value of this complex measure at Zm equals Ajn,i, i.e., the residue of g at Zm- 

Definition 8. Given an integrable complex-valued distribution p in C we define 
its Cauchy transform Cp{z) as 

It is not difficult to see that the distribution p itself may be restored from its 
Cauchy transform Cp{z) by 

TT OZ 

where Cp{z) and its ^-derivative are understood as distributions. As is well-known, 
any meromorphic function g defined in the whole complex plane C is the Cauchy 
transform of its residue distribution Ug if the condition J^dvg{^) < oo holds, see 
e.g. H p. 261]. 

Definition 9. Given a family {(/)n(z)}neN of smooth functions defined in some 
open set 17 C C one calls the limit ^(z) = lim„^oo "^^^(i)^^ asymptotic ratio of 
the family, provided that this limit exists in some open subset of fl. If {(/)n(2:)}ngN 
consists of analytic functions and denotes the residue distribution of the mero- 
morphic function '^'^'^^^^^ in 17 then the limit v — lim„^oo t^n (if it exists in the 
sense of weak convergence) is called the asymptotic ratio distribution of the family. 

Our next main result describes the support and the density of the asymptotic 
ratio distribution associated with sequences of rational functions constructed by 
using our approximation scheme. 

Theorem 4. Let f{z) be an algebraic function defined by (|1.1() and fix a generic 
initial k-tuple of rational functions IN = {go(-z), ■ • ■ , <lk-i{z)\- If denotes the 
residue distribution ofrn{z) and v = lim„^oo is the asymptotic ratio distribution 
of the family {qniz)}neN then the following holds: 

(i) The support of v does not depend on the set of slow growth T,fjj^. More 
precisely, the asymptotic ratio distribution v vanishes in a sufficiently small 
neighborhood of each point in S/jjv- 

(ii) Suppose that there exists a nonisolated point zq e such that equation 
(|1.2|l considered at zq has the property that among its roots with maximal 
absolute value there are at least two with the same maximal multiplicity 
(compare with Definition AKA below). If the sequence {?'„(zo)}rteN diverges 
then supp v = 'E.f. 

(iii) One has 

1 dydom I X ( dvi^) 

V = ^=> ydom[Z) ^ 

TT OZ 
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As a consequence of Theorems [JJEl we obtain the following asymptotic classifi- 
cation of the set of poles of the rational approximants {?'„(z)}„gN (see Figure^: 

Corollary 1. For any algebraic function f{z) and any initial k-tuple IN of rational 
functions the set of all poles of the family {r„(z)}„gN splits asymptotically into the 
following three types: 

(i) The fixed part consisting of a subset ofTfjj^i \ (S/ U S/jjv); 

(ii) The regular part tending asymptotically to the finite union of curves Sj, 
that is, the induced equimodular discriminant of (|1.2() ( cf. Theorem 0); 

(iii) The spurious part tending to the (finite) set of slow growth Sjjjv- 

The usage of the name "spurious" in Corollary ^ (iii) follows the conventional 
way of describing poles of rational approximants to / that do not correspond to 
analytic properties of the function /. This terminology has been widely adopted 
in Pade approximation, see e.g. j34| . The rigorous definition of spurious is rather 
technical, which is why we postpone it until §3 below. 




-3 -2-10 1 -3 -2-10 1 



Figure 1 . Poles of r^i (z) approximating the branch with maximal 
absolute value of the algebraic function f{z) with defining equation 
(z + l)y^ = {z^ + + {z- 5I)y + {z^ - 1 - I) for two different 
choices of initial triples. 

Explanations to Figure^ The two pictures show the poles of r3i{z) for the initial 
triples p_2(^) = P-iiz) = 0,poiz) = 1 andp_2(z) = z^+Iz'^-5, p-i{z) = z^-z+I, 
Pq{z) = 1, respectively (the second initial triple was picked randomly). The large 
fat point in these pictures is the unique pole z = — 1 of / which in this case coincides 
with TfjN since the chosen initial triples are polynomial. The smaller fat points 
on both pictures are the branching points of f{z). One can show that the curve 
segments belonging to S/ can only end at these branching points, see |TD] and |12| . 
Note that the poles form the same pattern on both pictures. This pattern is close to 
the curve segments constituting S j and contains an additional nine isolated points 
in the right picture. These nine isolated points form the set of slow growth in this 
case; one can also check numerically that the latter set is indeed a subset of the 
finite discriminantal set S described in the proof of Lemma 

The final result of the paper settles a natural analog of Nuttall's Conjecture (see 
§3 and |34j ) for our approximation scheme: 

Theorem 5. For any algebraic function f{z) and any initial k-tuple IN of rational 
functions there exists a finite upper bound for the total number of spurious poles of 
the approximants {?'n(z)}nGN associated with the data (f,IN). 
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As it was pointed out in IHJ, there is strong evidence that there always exists 
some infinite subsequence of the sequence of diagonal Pade approximants to a given 
analytic function for which there are no spurious poles in a certain convergence 
domain. It is interesting to note that for the standard choice of initial /c-tuple IN = 
{0,0, ... ,0, 1} in our approximation scheme none of the rational functions in the 
resulting sequence of approximants {r„(z)}„£pf has spurious poles, see Corollary |2| 

Let us conclude this introduction with a few remarks on the approximation 
scheme proposed above and some related topics. In the present setting the pointwise 
convergence of the sequence of rational approximants {r„(z)}„gN follows almost 
immediately from Theorem ^ (cf. Lemma |S1 below) . This is simply because our 
main recursion formula H1.3|l may actually be viewed as a recurrence relation with 
constant coefficients depending algebraically on a complex variable z. However, in 
many important cases one has to invoke quite different arguments in order to derive 
similar (pointwise) convergence results. For instance, the major challenge in the 
theory of finite difference equations is related to recurrence relations with varying 
coefficients. In this case the fundamental theorems of H. Poincare and O. Perron 
(see, e.g., ^1 pp. 287-298]) serve as prototypes for many new developments. Some 
interesting generalizations of the Poincare and Van Vleck theorems were recently 
obtained in |14l Typical applications of this sort of results pertain to the 

ratio asymptotics of various kinds of orthogonal polynomials and may be found in 
the well-known paper ^7\. We refer to and [3| for more recent results in this 
direction. In particular, in loc. cit. the authors study the asymptotics of the zeros 
of a family of polynomials satisfying a usual 3-term recurrence relation with varying 
and stabilizing complex coefficients and prove that these zeros concentrate on an 
interval in the complex plane. We continue this line of research in a forthcoming 
publication |12| where we study the asymptotics of general families of functions 
satisfying functional recurrence relations with varying and stabilizing coefficients 
and obtain appropriate extensions of Poincare's theorem. 

This paper is organized as follows. In §2 we give all the proofs, §3 contains a 
short description of Pade approximants and their spurious poles while §4 is devoted 
to proving the 3-conjecture of Egecioglu, Redmond and Ryavec. Finally, in §5 we 
compare our approximation scheme with Pade approximation and discuss a number 
of related topics and open problems. 

Acknowledgements. Communication with Alan Sokal has been of great help in 
relating our results to those of [5j . The authors are obliged to the anonymous 
referees for their constructive comments and for pointing out several references. 



2. Proofs and further consequences 

In order to prove Theorem|21and Theorem|31we need several preliminary results. 
Let us first fix the following notation: 

Notation 1. Let be a domain in C and T : 17 9 z T{z) — {tij(z)) € 
Mfc(C) be a fc X fc matrix-valued map. Denote by Xt ■ 3 z i-^ Xt(Ci ^) ^ ^[C] 
the characteristic polynomial map associated with T, that is, for each z £ fl the 
characteristic polynomial of T{z) is given by the polynomial Xt (Ci iii the variable 
^ with coefficients which are complex- valued functions of z. Let further At C 
be the discriminant surface associated with T, i.e., the set of all z e for which 
the (usual) discriminant of the polynomial (C, z) vanishes. Finally, let denote 
the equimodular discriminant associated with T, that is, the induced equimodular 
discriminant of the family of polynomials {xt(Cj^) | 2: G il} (cf. Definitional). 
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The following lemma is a consequence of well-known results on the diagonal- 
ization of analytic matrices which are based essentially on the implicit function 
theorem (see ^ p. 106]). 

Lemma 1. If the map T is analytic in then the eigenvalues Xi{z), 1 <i < k, of 
T{z) are analytic functions in n\ At- Furthermore, there exists an analytic map 
A : 17 \ At 9 2 i-> A{z) e GLfc(C) such that 

T{z) = A{z) diag(Ai(z), . . . , Xk{z))A{z)-\ z e 17 \ At- 

The map A{z) is uniquely determined in n\ At up to a nonvanishing complex- 
valued analytic function 7(2;). Moreover, its column vectors A{z)''^\ . . . , ^(z)^'^^ are 
eigenvectors ofT(z) with eigenvalues Xi(z), . . . , Xk{z), respectively, while the row 
vectors of its inverse v4(z)^^, . . . ,^(z)j^^ are eigenvectors of the transpose T(zy 
with eigenvalues Ai(z), . . . , Xk{z), respectively. 

Lemma 2. Under the assumptions of Lemma^ the following holds: 

(i) For any x = (a;A;_i, . . . , a;o)* G C*^ there exist complex-valued functions 
aij{z,x.), 1 < i, j < k, which are analytic m f2 \ At such that 

(\ t 
k k 
Y,a,,{z,^)X,{zr,...,Y,ak,{z,^)X,{zr 

for z e VL \ At and n g N. 

(ii) For z G \ let X{z) ~ Xi{z) denote the dominant eigenvalue of T{z). 
There exist ^J' -valued analytic functions u(z) = (ui(z), . . . , Ufc(z))* and 
v(z) = {vi{z), . . . ,Vk{z)Y defined on ft \ At such that 

v(z)*u(z) = 1 for z e fl \ At and 

TizY . (2-2) 

^ ' =^ u(z)v(z)* inVl \ {At U St) as n ^ 00. 

A[Z) 

Thus I A(z j" } sequence of analytic matrices which converges locally 

uniformly in fl\ {At U St) to u(z)v(z)* in the sup-norm on Mk{C). 

(iii) In the notations of (i) and (ii) one has 

k 

a^l{z,-l^) = Ui{z)'^Vj{z)xk^j, l<i<k, (2.3) 

for any x = {xu-i, . . . , Xq)* G . 

Proof. Part (i) is a direct consequence of Lemma^ To prove (ii) recall the notations 
of Lemman^iid set 'vl{z) — A(z)'^^^ and '^{z) = A(z)^^, so that v(z)*u(2;) = 1 for 
all z G \ At ■ From Lemma ^ one gets 

^w"-„,),..,rjMi)v,...,rA#)") (2.4, 



A(z)" ' ' ='V ' V K'A ) '■■■'V 

for all 2; G \ (At U St) and n G N, which immediately proves (|2.2|) since by 
assumption |Aj(z)| < |A(z)| if 2 < j < fc. Finally, H2.3I) follows from (i) and (ii) by 
elementary computations. □ 

Recall now the polynomials /i(z), < i < fc, from 1)1. 1|1 and define the rational 
functions 

R^{z) = ~^^ 0<i<fc-l. (2.5) 
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Choosing any initial fc-tuplc of rational functions IN = {qo{z), . . . ,qk~i{z)} one 
can rewrite recursion as 



Qn{z) = Rk-i(z)qn-i(z) for n > fc 



(2.6) 



1=1 

or, equivalently, 

q„(z) = T(z)""''+^qfc_i(z) for n>k, where 

q.Tn{z) = {qm{z),qm-i{z), ■ ■ ■ ,qm-k+i{z)Y G C'' for m > fc - 1 and 
(Rk-i{z) Rk-2{z) Rk-3iz) ... Ro{z)\ 



T{z) 



\ 



(2.7) 



...10/ 

We now apply Lemmas ^ and [3 to this specific case. It is easy to see that the 
characteristic polynomial of T{z) is given by 

fc-i 

x,(C, ■A = e-Y. Rk-^-l{z)C'-'-\ (2.8) 

j=0 

Moreover, in view of Theorem [SJ Notation ^ and Lemma [3 (ii) one has 

ydom{z) = A(z) for z e C \ (T/jAT U S/), where 5/ = St- (2.9) 

Remark 3. Identity 12.9|l implies in particular that ydom(z) is a nonvanishing ana- 
lytic function in C \ (T/jjv U S^). 

Remark 4. The discriminant At associated with the map T{z) given by H2.7|l is a 
finite subset of C since all the entries of T{z) are rational functions of z (cf. H2.5(l V 

It turns out that in our specific case the vector-valued functions u(z) and v(z) 
defined in Lemma [21 (ii) can be computed explicitly: 

Lemma 3. For z G C \ (T fj^ US/) let u(z) and v(z) be eigenvectors with eigen- 
value X{z) of T{z) and T{zY , respectively. There exist complex-valued functions 
li{z),^2{z) which are analytic and nonvanishing in C \ (TfjN U Sj) such that 

u{z) = 7i(z)(mi(z), . . . , Uk{z)y, where Ui{z) = X{z)''^\ I < i < k, 

k 

v(z) = 72(z)(wi(z), . . .,Vk{z))\ where Vi{z) = ^ Rk^j{z)X{zy^\ 1 <i <k. 

In particular, the maps u(z), -v(z) and u(z)v(z)* are all analytic m C\(Tjj7vUSj). 
Moreover, the functions 71 (z), 72(2) may be chosen so that v(z)*u(z) = 1 for all 
zeC\{TfjNUEf). 

Proof. Note that u(z) and v(z) are analytic C'^-valued functions in C\ (T/jjyUS/) 
since A(z) is a simple eigenvalue of T(z) (and r(z)*) for all z in this set (cf. the 
proof of Lemma The explicit forms of u(z) and v(z) are easily obtained by 
elementary manipulations with the matrix T{z) defined in (|2.7|l . Note also that 
since A(z) is a root of the characteristic polynomial XriCi z) of T{z) one has 



v(z)*u(z) =7i(z)72(z) ^(fc-™)i?™(z)A(z)™ =7i(^)72(^)A(z)|:X.(C,^) 



C = A(2) 



Therefore, if v(z)*u(z) = then A(z) must be a multiple root of the characteristic 
polynomial XriCiz) of T{z). However, this contradicts the assumption z G Sj. 
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Thus v(z)*u(z) ^ whenever z G Ef, which proves the last statement of the 
lemma. □ 

Henceforth we assume that the complex- valued functions 7i(z),72(^) defined in 
LemmaOlare chosen so that v(z)*u(z) = 1 for all z e C \ {T fjN U S/). 

Remark 5. By Remark |31 all coordinates of u(z) are nonvanishing (analytic) func- 
tions in C \ (T/jjv U Ef). 

Lemma 4. With the above notations one has 

""^^^ -li{z)^2iz)uiiz)J2v'^iz)qk-.iz) mCMTfjMUEf) (2.10) 



A(z)" 



i=l 



Proof. Note that in view of recursion (|2.6|) all the functions f^jpr, n £ N, are 
analytic in C \ (TyjAr U Sj) while Lemma 13 implies that the same is true for the 
function in the right-hand side of p. 10(1 . From Lemma El (i)-(iii) it follows that 

1^ ^ -fiiz)^2{z)ui{z) J2 <z)qk-^{z) in C \ (T;,,^ U % U At) (2.11) 

as n ^ oo. Since Ay is a finite subset of C (cf. Remark^J Cauchy's integral formula 
shows that the local uniform convergence stated in 12.11|l must actually hold even 
in an e- neighborhood of the set A^ \ (T/^^v U S/), which proves H2.10|l . □ 

To further simplify the notations set 

^ ^ i—l 

We can complement Lemma 0] with the following result: 

Lemma 5. Given Q < e <^ \ let Vc = CP \ V^, where Ve is the e-neighborhood 
ofTfjjst U in the spherical metric. The rate of convergence of gn{z) =| g{z) 
in Ve is exponential, that is, there exist constants £e and & (0, 1) such that 
\g„{z)-g{z)\ <€,p'', for allzeV,. 

Proof. By Lemma |21 (i) and (|2.4|) we get an exponential rate of convergence for 
gn{z) ^ g{z) in CP \ Ue, where 11^ is the e-neighborhood of T/jjv U U A^ in 
the spherical metric. Since gniz), n G N, and g{z) are all analytic functions in 
C \ (T fjN US/) and At is a finite subset of C we get the desired conclusion again 
by Cauchy's integral formula. □ 

The next lemma gives a complete description of the set of slow growth "SfjN 
that appears in Theorem [21(cf. Definitions 13 and : 

Lemma 6. Let IN — {qo^z), . . . , qk-i{z)} be a fixed initial k-tuple of rational 
functions as in 12.6|l . Then 



so that in particular |Sjjjv| < oo- 



k k 

A(z)^-^gfc_,(z)i?,_,(z) = ^ , (2.13) 

i=l j=i 
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Proof. The fact that 'Sf.iN is given by H2.13|l follows readily from Lemmas and 
Remark El In order to show that |I]y./Ar| < oo one can produce an explicit upper 
bound for |Sj_/jv| in the following way: set 

k k 
i=l j=i 

and let S{z) denote the resultant of the polynomials Xt (Cj ^) ^^^d fi^ ^„ (C, z) in the 
variable (. Let further S = {z Q C \ S{z) = 0}. Since S{z) is a rational function 
and ^fjN ^ 5 it follows that |E/jAr| < |iS| < oo. □ 

Both Theorem |5| and Theorem |31 are immediate consequences of Lemmas 
above. The same ideas yield also a proof of Theorem [S] Actually, we can produce 
an asymptotically optimal bound for the number of spurious poles of the rational 
functions r„(z). To do this let us introduce the following notation. 

Notation 2. Fix e > such that S/jtv H (T/jjv U S/)^ = 0, where (T/^jv U S/)^ 
denotes the e-neighborhood of /jyUSy in the spherical metric. Given a sequence 
of rational functions {g„(2)}neN as in H2.6|l denote by Z^P{qn) the set of zeros of Qn 
which are contained in C\(T/^/jvUS/)e. The total cardinality \\Z^P{qn)\\ of Z^P{qn) 
is the sum of the multiplicities of each of its elements as zeros of g„. The total 
cardinality of 'Sfjj^f is ||S/^/7v|| = J2z<^'Sf ™('^)' "^here m{z) is the multiplicity of 
z G S/jAT as a zero of the equation in H2.13|l . 

Lemma 7. Let IN — {qolz), . . . , qk-i{z)} be a fixed initial k-tuple of rational 
functions. In the above notation there exists N £ N such that ||Z|^'(g„)|| < ||E^j7v|| 
forn>N. Thus 

ll^f (g„)|| < maxdlZf (go)ll, . . . , ll^f (g^-i)IU|S/,/^||) 
for all n e N. 

Proof. Recall (|2.12ll and note that since |S/jjv| < oo it is enough to show that for 
any zq e '^fjN there exists Nq e N and ep > such that gn{z) has (at most) m{zo) 
zeros in {z e C I |z — zqI < £o} for n > Nq. But this is an immediate consequence 
of the uniform convergence established in Lemma ^ and Hurwitz's Theorem. □ 

An interesting consequence of the above results is that by using our scheme one 
can always find rational approximants {?'„(z)}„gN that have no spurious poles: 

Corollary 2. // / is an algebraic nonrational function and IN is chosen to be 
the standard initial k-tuple {0,0, . . . , 1} then S/./Af = 0. In particular, the ratio- 
nal approximants {r„(z)}„eN corresponding to the standard initial k-tuple have no 
spurious poles. 

Proof. Indeed, if E/jat 7^ and z g ^fjN then by H2.8|l and (j2.13(l the correspond- 
ing dominant root ydom{z) — X{z) has to satisfy 

fc-l k 

Xiz)" - ^i?fc_,_i(z)A(z)^-'-i = and ^ (z)A(z)i-^" = 0, 

i=0 j=l 

which gives A(z) — 0. However, the latter identity cannot be fulfilled by the domi- 
nant root of (|1.3|l if / is an algebraic nonrational function. □ 

Remark 6. An analog of Corollary |21 for 3-term recurrence relations satisfied by 
certain biorthogonal polynomials was proved in TJ, §3]. 

Remark 7. Corollary[21does not hold for arbitrarily chosen initial fc-tuples consisting 
of constant functions, as one can easily see by considering e.g. the initial fc-tuple 
/iV = {l,0,...,l}. 
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For the proof of Theorem 0] we need some additional notation and results. 

Notation 3. Let D\tCk be the fc-dimensional complex linear space consisting of 
all (fc + l)-term recurrence relations with constant coefficients of the form H1.4|l . 
We denote by the fc-dimensional complex linear space of all initial fc-tuples 

{uq, ...,Uk-l). 

Recall the notion of recursion of nondominant type introduced in Definition [21 

Definition 10. A nondominant recurrence relation in fHeCfc with initial fc-tuple 
IN e 3D1fc is said to be of subdominant type if the following conditions are satisfied. 
Let Ti, . . . , Ts, s < fc, denote all distinct spectral numbers with maximal absolute 
value and assume that they have multiplicities rrii, . . . , mg, respectively. Then there 
exists a unique index iq G {1, . . . , s} such that nii < rriig for i G {1, . . . , s}\ {«o} and 
the initial fc-tuple IN is fast growing in the sense that the degree of the polynomial 
Pig in (|1.6() corresponding to r^g is precisely rriig — 1. The number t^q is called the 
dominant spectral number of this recurrence relation. 

The following lemma is a simple consequence of Theorem ^ 

Lemma 8. In the above notation the following is true: 

(i) The set of all slowly growing initial k-tuples with respect to a given dominant 
recurrence relation in IHeCfc is a complex hyperplane SQk in 301^. The set 
SQk is called the hyperplane of slow growth. 

(ii) For any dominant recurrence relation in d\eCk and any fast growing initial 
k-tuple in J^k the limit lim„^oo exists and coincides with the domi- 
nant spectral number Tmax , that is, the ( unique ) root of the characteristic 
equation (|1.5(l with maximal absolute value. 

(iii) Given a nondominant recurrence relation of subdominant type in 9^eCfc and 
a fast growing initial k-tuple in the limit lim„^oo "^^^ exists and co- 
incides with the dominant spectral number. 

(iv) For any nondominant recurrence relation in 9^ec/c which is not of subdomi- 
nant type the set of initial k-tuples for which lim„_>oo ^^^7^ exists is a union 
of complex subspaces of 3^k of positive codimensions. This union is called 
the exceptional variety. 

Proof. In order to prove (i) notice that the coefficient Umax in Definition |21 is a 
nontrivial linear combination of the entries of the initial fc-tuple {uq, . ■ . , Ufe-i} with 
coefficients depending on ai, . . . , afc. Therefore, the condition Kmax = determines 
a complex hyperplane SQk in INk. One can easily see that the hyperplane of slow 
growth is the direct sum of all Jordan blocks corresponding to the spectral numbers 
of a given recurrence (|1.4|l other than the leading one. 

The assumptions of part (ii) together with \i.<6\ yield m„ = ^maxT^^ax + • • ■ for 
n G Z+ , where the dots stand for the remaining terms in H1.6() corresponding to the 
spectral numbers whose absolute values are strictly smaller than |T„jaa;|. Therefore, 
the quotient "^^^ has a limit as n — > cxd and this limit coincides with Tmax, as 
required. By definition Tmax is a root of (|1.5(l . which completes the proof of (ii). 
This last step can alternatively be carried out by dividing both sides of H1.4|l by 
Un~k and then letting n 00. In view of Definition 1101 the same arguments show 
that the assertion in (iii) is true as well. 

For the proof of (iv) we proceed as follows. Take any nondominant recurrence 
relation of the form (|1.4|l and let ri, . . . , r,., r < fc, be all its distinct spectral numbers 
with maximal absolute value. Thus \Ti\ = \Tmax\ if and only 1 < z < r. Choose an 
initial fc-tuple IT = {uq, ■ • ■ j Wfc-i} and denote by pi, the polynomials in (|1.6|l 
corresponding to ti, . . . , t^, respectively, for the sequence {u„ | n G Z+j constructed 
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using the given recurrence with initial fc-tuple IT as above. Assuming, as we may, 
that our recurrence relation is nontrivial we get from H1.6(l that 7^ if 1 < i < r. 
We may further assume that the degrees c?i, . . . ,dr of the polynomials pi, ■■■,Pr, 
respectively, satisfy di > . . . > dr. A direct check analogous to the proof of part 
(ii) shows that if only the polynomial pi is nonvanishing then lim„^oo = ti. 
If r > 2 and s G {2, . . . , r} is such that pi, . . . ,ps are all nonvanishing polynomials 
among pi, . . . ,pr then using again 1)1. 6|l we get 

/ \ n+l / \ n+1 

Pi{n + l)+p2{n + l) (^) +...+p,(n + l) + o(l) 

Un+l _ \Tl J \TiJ 

Pi{n)+p2{n)(^^^ +...+p,{n)(^^^ + o(l) 

Since jfj"! = 1 and 7^ ti, 2 < i < s, it follows that if di = d2 then the expression 
in the right-hand side has no limit as n ^ 00. Therefore, if such a limit exists then 
di > d2, which gives us a complex subspace of INk of (positive) codimension equal 
to di ~ ^2 • Thus the exceptional variety is a union of complex subspaccs of INk of 
(in general) different codimensions. □ 

Proof of Theorem^ To prove the first statement notice that under the assump- 
tions of the theorem the dominant root of the asymptotic symbol equation (|1.2II 
is a well-defined analytic function in a sufficiently small neighborhood of Sjjjv- 
Therefore, the residue distribution (|1.7() associated to this dominant root vanishes 
in a neighborhood of "EfjN- Thus the set S/./at of slowly growing initial conditions 
can be deleted from the support of the asymptotic ratio distribution v. 

In order to show that under the nondegeneracy conditions stated in (ii) the sup- 
port of I' must coincide with it is enough to prove that the sequence {r„(z)}„gN 
diverges almost everywhere in . This is actually an immediate consequence of the 
fact that the nondegeneracy assumptions in (ii) imply that at almost every point 
z € 'E.f recursion p. 3(1 is neither of dominant nor of subdominant type (cf . Defini- 
tions El and . Indeed, since the set ^fjN of slow growth is finite it follows from 
Lemma |Hl(iv) that the sequence {rn{z)}neN diverges for almost all 2; S S/. 

Finally, in order to prove (iii) notice that by Definitions[7|-|Slany rational function 
r is the Cauchy transform of its residue distribution and that by Theorem[5]one 
has r„(z) — ^" '^^^^^■^ ydom{z) in C\Vf. Therefore, if denotes the residue 
distribution of r„ then the Cauchy transform of v := lim„^oo equals ydom- 
It then follows from the standard relation between a distribution and its Cauchy 
transform (see (ll.8|l l that 

1 dC, 1 dya om 

V = —- 



TT dz TT dz ' 

which completes the proof of the theorem. □ 

Proof of Corollary Q The assertion is a direct consequence of the previous results 
since Theorem |3 implies that for any sufhciently small e > the complement 0^ of 
the e- neighborhood of I?/ = U T/jjv U E/^at in the spherical metric contains 
no poles of r„(z) for sufficiently large n. Note in particular that by Theorem 0] (iii) 
the regular part of the poles of the family {r„(z)}„gN must tend to in such a 
way that lim I'n = = ^ VdorAz) ^ ^ 

3. A BRIEF EXCURSION AROUND PADE APPROXIMATION 

In order to make this paper self-contained and to be able to compare our approxi- 
mation scheme with the classical Pade scheme we give in this section a brief account 
of Pade approximants. For a detailed overview of this subject we refer to e.g. |2] 
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and pi]. As in §1 we denote by Vk the linear space of all complex polynomials of 
degree at most k. 

Definition 11. Let f{z) be a function analytic at infinity with power series ex- 
pansion f{z) = '^°^QCiZ~^. For each pair {ra,n) G there exist two polynomials 
Pni.n e Vra and q^.n & Vn \ {0} such that 

qrn,n ( -) f (z) ~ p,„,„ ( -) = 0(z-™-"-l) (3.1) 



as z — > oo. The rational function 



is called the (m,n)-Pade approximant to the function f{z) (expanded at oo). 

The notation introduced in Definition 1111 will be used throughout this section. 
Fade approximants can be seen as a generalization of Taylor polynomials to the field 
of rational functions. Note that in view of (|3.1(l these approximants are well-defined 
and exist uniquely for any function f[z) that has a power series expansion at oo. 
Fade approximants are also closely related to (generalized) orthogonal polynomials. 
As we shall now explain, this connection is especially simple in the case of diagonal 
Pade approximants, that is, approximants of the form [rt/ri](z). To this end let us 
define the reverse denominator polynomial Qn of (7„_„ by 

Qn{z) = z"qn,n Q 

The following results may be found in IMI Lemma 2.2]. 

Lemma 9. A polynomial qn.n € 'Pn \ {0} is the denominator of the Pade approx- 
imant [n/n] to f if, and only if, the reverse polynomial Qn of qn,n satisfies the 
(generalized) orthogonality relation 

CQniOfiOdC^O, {0,...,n-l}, (3.3) 

c 

where C is a closed contour containing all the singularities of f{z) (so that f{z) is 
analytic on C and in its exterior). 

Condition (|3.3|l is a (generalized) orthogonality relation for the polynomial family 
{Qn{z)}neN on the contour C with respect to the complex-valued measure /(C)dC- 
The fact that the quadratic form 

ig,h)f -.^ f giCMOfiOdC 



c 

diagonalizcd by the family {Qn(-z)}rieN is in general neither Hermitian nor positive- 
definite accounts for the rather chaotic behavior of the zero loci of the polynomials 
Qn{z), ri e N, see and the results below. 

Remark 8. Note that the generalized orthogonality relation H3.3(l and Favard's the- 
orem imply that if the polynomials in the family {Qn{z)}neN are assumed to be 
monic then they satisfy a three-term recurrence relation of the form 

zQn{z) = Qn+liz) + anQ„{z) + PnQn-liz) (3.4) 

with complex- valued coefficients a„ and /3„ , see |16| . Apparently, the chaotic behav- 
ior of the zero loci of the polynomials Qn{z), n e N, is inherited by the families of 
coefficients {a„}„gN and {/3n}nGN, as these do not necessarily converge, see §5.2.3. 
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Probably the most important general theorem that applies to functions mero- 
morphic in the complex plane is that of Nuttall-Pommercnke asserting that if / 
is meromorphic in C and analytic at oo then the sequence of diagonal Fade ap- 
proximants converges to f{z) in planar measure (cf. 0001). The 

convergence of the diagonal Fade approximants to /(z) in the case when f{z) is a 
(multi- valued) algebraic function was studied in detail in |30| . As pointed out in 
loc. cit., the following important result was proved already in jl7| . 

Theorem 6. Let f{z) be a locally meromorphic function in some domain CP^ \ E, 
where E is a compact subset of C with cap(-B) = 0. There exists a domain Df C 
CF^ such that the sequence of diagonal Fade approximants {[n/n]{z)}neN converges 
in capacity to f{z) in Df but it does not converge in capacity to f{z) in any domain 
D C CP^ with cap{D \D)>0. 

The domain Dj in Theorem|S|is uniquely determined up to a set of zero capacity 
and is called the convergence domain of f{z). (Concerning the notion of (logarith- 
mic) capacity and convergence in capacity one may consult e.g. [SEl-) The following 
improvement of Theorem was obtained in |^ by building on the results of IH5| . 

Theorem 7. Let f{z) be a locally meromorphic function in some domain CP^ \ E, 
where E is a compact subset of <C with cap(i?) = 0. The convergence domain Df 
of f{z) is uniquely determined (up to a set of capacity zero) by the following two 
conditions: 

(i) Df is a subdomain o/CP""^ containing oo such that f{z) has a single-valued 
meromorphic continuation throughout Df; 

(ii) cap([CP^ \ Z?/]^""") = inf^ cap([CP"'" \ -D]^"'"), where the infimum ranges over 
all domains D C CP""^ satisfying (i). (Here for a given set S C CP^ we 
denote by the set of all z S CP such that z"^ g S .) 

In other words, Theorem |3 asserts that the convergence domain Df of /(z) is 
characterized by the property that its boundary dD f has minimal capacity. 

Definition 12. Given a function /(z) analytic near cxd we say that a compact set 
K C CP is admissible for f(z) if f(z) has a single-valued continuation throughout 
CP^ \ K. 

Using the terminology introduced in Definition 1121 one may rephrase Theorem 
13 as follows: given a locally meromorphic function /(z) in some domain CP^ \ E, 
where is a compact subset of C with cap(i?) = 0, there exists an unique admissible 
compact set Ki^ C CP"'^ such that 

cap(iiro) = inf cap(_ft'), 

where the infimum is taken over all admissible compact sets K C CP'^. 

Let goiz, w) denote the Green's function of a domain D C CP^ with ca.p{dD) > 
0. The following result may be found in j85l Fart III, Theorem 1 and Lemma 5]. 

Theorem 8. In the above notation the minimal set Kq is the union of a compact 
set Eq with analytic Jordan arcs. The set Eq has the property that the analytic con- 
tinuation of f{z) in CF^ \Kq has a singularity at every boundary point of Eq while 
the analytic arcs in Kq \ Eq are trajectories of the quadratic differential Tf{z)dz'^ 
given by 

^/(^) = |^£'Cpi\i^o(^'Oo)| • 
Moreover, if f{z) is algebraic then Tf[z) is a rational function. 
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Theorem 9. Let f{z) be an algebraic nonrational function which is analytic near 
oo. If all singularities of f{z) are contained in a set W = {wi, • . • ,Wti} C C then 
there exist a subset W' of W with \W'\ — n' > 2 and a set Z d C with (not 
necessarily distinct) n' — 2 points such that 

i-i-wjew'[z — Wj) {oz 

Here c is an appropriate complex constant and z G CP^ \-?^0; where Kq is the union 
of trajectories of the quadratic differential Tf{z)dz'^ and a set of isolated points 
contained in W . 

Theorem |51 shows that most of the poles of the diagonal Pade approximants 
tend to Kq when n oo. However, not all of them do! Indeed, even in 
the case when /(z) is algebraic one cannot hope for a better convergence type than 
convergence in capacity (which is only slightly stronger than convergence almost 
everywhere with respect to Lebesgue measure). As shown by the following example 
of H. Stahl, uniform convergence in CP^ \ Kq fails in a rather dramatic way. 

Example 2. Consider the function 

, (2 — cos7rai)(z — C0S7rQ;2) / n 

j(z) — r^=^= z + (cosTrai + cos7ra2j, 

Vz^ — 1 

where 1, ai, are rationally independent numbers. Then the reverse denominators 
{_Qn{z)\n<3i Satisfy the generalized orthogonality relation 

(a;-cos7rai)(a;-cos7ra2) , , 
Qn(x)Qm(x) -j^=^= dx = tor TO 7^ n. 

-1 TTV 1 — X^ 

Even though each polynomial Qni^z) has at most two zeros outside [—1, 1], the zeros 
of Qn iz) cluster everywhere in C as n — *■ 00 (see ISI Example 2.4]). 

A naive definition of spurious poles of Pade approximants would be that these 
are the poles which are not located near Kq. A more rigorous definition is as follows 
(cf. PI Definition 4.1]). 

Definition 13. Let / be a function satisfying the conditions of Definition 1111 
Let further A/" C N be an infinite sequence and {[n/n\}n£j\f be the corresponding 
subsequence of diagonal Pade approximants to /. We define spurious poles in two 
different situations: 

(i) Assume that for each n £ Af the approximant [n/n] has a pole at z„ e CP^ 
such that Zn zq as n 00, n Cz M . If / is analytic at zq and the 
approximants [n/n\n^j^ converge in capacity to / in some neighborhood 
of zq then the poles of the approximants [n/n] at z„, n G A/", are called 
spurious. In case zq = 00 the convergence z„ zq has to be understood in 
the spherical metric. 

(ii) Let the function f{z) have a pole of order fco at zq and assume that for each 
n (z JV the total order of poles of the approximant [n/n] near zq is ki = 
fcijTt > ^0- Assume further that [n/n] has poles at z„ j, j e {1, . . . ,TO„}, 
of total order fci^„ and that for any selection of j„ G {!,... ,m-n} one has 
Zn,jri Zq as n ~* 00, n Cz M . Then poles of order fci — fco out of all poles 
of the approximants [n/n] near zq, n Cz M, are called spurious. 

With this definition we can finally formulate the Baker-Gammel- Wills (Pade) 
Conjecture and some related questions. 

Pade (Baker-Gammel- Wills) Conjecture. // the function f is meromorphic 
in the unit disk D and analytic at 0, and if {[n/n]}neN denotes the sequence of Pade 
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approximants to f expanded at the origin, then there exists an infinite subsequence 
J\f QN such that [n/n]{z) =t /(z) m D \ {poles of f\ as n ~* oo, n e A/". 

The Pade Conjecture is widely regarded as the main chaUenge in the theory 
of Pade approximation. The importance of this conjecture is due to the homo- 
graphic invariance property of diagonal Pade approximants and its many conse- 
quences, see |2- A major breakthrough was recently made by D. Lubinsky 
who showed that the general version of the Pade Conjecture stated above is false. 
Subsequently, V. Buslaev TS* constructed counterexamples to this conjecture for 
some special types of algebraic (hyperelliptic) functions. However, the important 
question whether the Pade Conjecture could hold for certain classes of algebraic 
functions is still open. A key step in this direction would be a deeper understand- 
ing of the asymptotic distribution of spurious poles of diagonal Pade approximants. 
To this end J. Nuttall proposed the following conjecture (cf. 34 ): 

Nuttall's Conjecture. Let f be an algebraic function which is analytic at oo. 
Then there exists an upper bound for the number of spurious poles (in the sense of 
total order) for all Pade approximants [n/n], n € N. 

Note that Theorems |21 and |S1 show that natural analogs of both the Pade Con- 
jecture and Nuttall's Conjecture are always true for the rational approximants to 
algebraic functions constructed by using our approximation scheme. 

4. Application: The 3-conjecture of Egecioglu, Redmond and Ryavec 

The following interesting conjecture appeared in IS - where it was nicknamed 
the 3-conjecture - in connection with counting problems for various combinatorial 
objects such as alternating sign matrices with vertical symmetries and rt-tuples of 
nonintersecting lattice paths. 

Conjecture 1. All polynomials belonging to the sequence {pn{z)}ni£N defined by 
the A-term recursion 



have real zeros if and only if C >Z. Moreover, if C > i then the zeros of pn+i{z) 
and Pn{z) are interlacing for all n g N. 

Theorem 1 of proves the sufficiency part as well as the last statement in the 
above conjecture. We shall now fully settle the 3-conjecture by showing that C > 3 
is indeed a necessary condition for the validity of this conjecture. 

Proposition 1. // C < 3 and {pn{z)}n&i is th^ sequence of polynomials defined 
by recursion (14. 1|) then there exists N € N such that not all zeros of pm{z) are real. 

The proof of Proposition^is based on the ideas developed in the previous sections 
and goes as follows. We study the behavior of the branching points of the curve 
Tq given by solutions t of the asymptotic symbol equation 



considered as a branched covering of the z-plane (cf. Definition More precisely, 
we show that for any C < 3 the curve Fc has two complex conjugate branching 
points and the induced equimodular discriminant Sc of (|4.2|l is a real semialgebraic 
curve ending at these branching points (cf. Therefore, if e > there exists 

some sufficiently large A^ e N such that for all n > A^ the polynomial Pn{z) has 
(complex conjugate) roots lying in the e-neighborhoods of these branching points. 



Pn{z) = zpn-l{z) ~ Cpn-l{z) - p„_3(z), whcrC 

P-2{z) = p-i{z) = 0, po{z) = 1 and C G R, 



(4.1) 



t^ - zt'^ + Ct + 1 = 



(4.2) 
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Figure 2. Zeros of ^41(2) satisfying (|4.1|l for C = 4, 3, 1, -1, -2. 



Explanations to Figure The five pictures show the qualitative changes of the 
induced equimodular discriminant 'E.c and the branching points of the curve Tc 
when the parameter C runs from 00 to ~oo. The fat points are the branching points 
of Fc while the thin points illustrate the zeros of the polynomial p4i(z). The first 
picture depicts the typical behavior for C > 3. In this case the branching points of 
Tc are all real and the equimodular discriminant 'E.c is the segment joining the two 
rightmost branching points. If C = 3 then the two leftmost branching points of Tc 
coalesce. For — 1 < C < 3 there are one real and two complex conjugate branching 
points of Tc and the induced equimodular discriminant Sc is the F-shaped figure 
joining these three points. If C = — 1 then the triple point occurring in the previous 
case coincides with the (unique) real branching point of Tc- Finally, if C < — 1 
then one has a stable configuration consisting of the equimodular discriminant Sc 
joining the two complex conjugate branching points of Tc and the remaining (real) 
branching point of Tc lying outside S^. 

Straightforward computations using Mathematica"'"^ yield the following result: 

Lemma 10. The branching points ofTc satisfy the equation 

Az^ + C^z^ - 18Cz - 27 - 4C^ = 0. (4.3) 

The discriminant of the polynomial in z occurring in the left-hand side of (|4.3|l is 
given by 

6A{C ~3f{C^ + 3C + 9f. (4.4) 

Lemma 11. The behavior of the induced equimodular discriminant near a 
branching point of Tc changes if and only if the branching point becomes degen- 
erate, i.e., the absolute value of the double root of the asymptotic symbol equation 
corresponding to the branching point coincides with the absolute value of the re- 
maining root. This happens precisely for those values of C for which there exist 
z, T, 9 such that the relation 

t^ - zt^ + Ct+1 = {t-rfit-Tc'^) (4.5) 

is satisfied for all t (z C If C is real this occurs only for C — 3 and C — —I. 

Proof. The first part of the lemma is obvious and can be found in e.g. |10| . To 
prove the statement about the values of C consider the system for the coefficients 
of 

z =T(2 + e**) 
C = r2(l + 2e'^). 
1 = -T^e''^ 
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The last equation gives t = -e*^ e+2fc7r)/3^ where k e {0, 1,2}. Substituting this 
expression in the second equation one gets 

(J ^ g-24(e+4fc7r)/3 _|_ 2g«(e+4fc7r)/3 

Elementary computations now show that C is real if and only if 9 ~ (3m — 4fc)7r 
for some m g Z, which gives either C = 3 (in which case z — —3 and r = — 1) or 
C = — 1 (in which case z = 1 and t = 1). □ 

Proof of Proposition Let us first show that it suffices to check that the following 
three conditions hold for any value of C G (— oo, 3): 

(i) the curve Tq has two (distinct) complex conjugate branching points; 

(ii) at each of these two complex branching points of Tc the absolute value 
of the double root of the corresponding asymptotic symbol equation (|4.2(l 
exceeds the absolute value of the remaining root; 

(iii) at a generic point zq S Sc the roots of the asymptotic symbol equation (|4.2II 
with maximal absolute value have distinct arguments and the sequence 
I Pn{zo) I diverges. 

Indeed, if (i) and (ii) are true then the induced equimodular discriminant Sc must 
necessarily pass through (actually, end at) the two complex conjugate branching 
points of Tc (cf. jJJJEj). In particular, there exist some nonisolated point ( e Sc 
and e > such that {z e C | |z - CI < e} C C \ E. Moreover, if (iii) holds then 
Theorem 0] (ii) implies that Sc" — supp v, where v denotes the asymptotic ratio 
distribution of the family {pn{z)}neti- Therefore, there exists some sufficiently 
large N G N such that for all n > TV the polynomial Pn{z) has at least one root in 
{z e C I |z - CI < e} C C \ K, as required. 

To verify conditions (i)-(iii) note first that if C < 3 then (|4.4|l implies that all 
three branching points of Tc are necessarily distinct. Since (|4.3|l is a polynomial 
equation with real coefhcients depending continuously on C it is enough to check 
that H4.3(l has (distinct) complex conjugate roots for an arbitrarily chosen value of 
C in the interval (— oo,3). This is easily done numerically for e.g. C = 0. Notice 
next that by using a similar continuity argument together with Lemma 1111 it is 
enough to verify condition (ii) for C = — 1 and for arbitrarily chosen values of C in 
the intervals (— oo,— 1) and (—1,3), respectively. This is again an easy numerical 
test for e.g. C = —1, C = and C — 2. Finally, in order to check (iii) note 
that at any point zq S Sc which is not a branching point of Tq the roots of the 
asymptotic symbol equation 1)4. 2|l with maximal absolute value have necessarily 
distinct arguments since these roots must be distinct. Fix any such zq and denote 
by ri(zo), 1 < i < 3, the (distinct) zeros of the corresponding asymptotic symbol 
equation (|4.2() . so that Ti(zo) 7^ for 1 < i < 3. By H4.1|l and H1.6|l there exist 
complex constants Ki{zo), I < i <3, such that 

3 

Pnizo) = Y,K,{zo)n(zor forne {-2,-l,0}UN. 

i=l 

Using the initial values p_2(z) = p_i(z) = and poiz) = 1 together with the above 
formula for Pn(zo) one can easily check that ifi(zo) 7^ for 1 < i < 3. Assume that 
|n(zo)| = |t2(zo)| > |t3(zo)|. Then 

A-.M.A-,,„)(^)%.3(.)(^)' 

" '"~\,,..) + A,m(=M)""' + A-3„.)(-M)'-' 

\Tl[Z0)J \Tl[Z0)J 



Pn{zo) , X 
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and since 



— 1 and T2{zo) ^ ti(zo) it follows that the right-hand side of the 

Ti{zoj 

above identity has no limit as n — )■ oo, which proves (iii). □ 



5. Comparison of approximation schemes and open problems 

5.1. Approximation via the algebraic scheme versus Pade approximation. 

As we already pointed out in the introduction, the usual Pade approximation and 
the approximation scheme introduced in this paper have an essentially different 
range of applications. Nevertheless, when applied to an arbitrarily given algebraic 
function / whose defining equation is known the scheme for approximating the 
branch with maximal modulus of / proposed above has a number of advantages 
compared to the usual Pade approximation scheme for /: 

(1) While Pade approximation requires the knowledge of the Taylor expansion 
at oo of /(z), our scheme uses only the defining algebraic equation for f{z). 

(2) The regular poles of Pade approximants concentrate on the union of cer- 
tain trajectories of the quadratic differential described in §3. While it is 
in general rather difficult to understand the structure of these trajectories, 
the induced equimodular discriminant S / has a more transparent definition 
and is also easier to study. Interesting examples of equimodular discrimi- 
nants relevant to the behavior of Potts models on sequences of finite graphs 
tending to certain infinite lattices have been studied in a number of papers 
by A. Sokal and his coauthors, see e.g. |ES1 and They also developed 
two approaches for finding explicitly. One is a more or less direct ap- 
proach while the other is based on a resultant type equation which can also 
be found in jlOj . 

(3) In general, Pade approximants have spurious poles with uncontrolled be- 
havior which makes uniform convergence impossible. Our scheme has only 
a finite number of spurious poles tending to a quite understandable finite 
set YifjN. In addition to that, the latter scheme has also a well-controlled 
(exponential) rate of convergence. 

(4) The denominators of the rational approximants {r„(z)}„gN constructed in 
this paper satisfy a simple recursion with fixed rational coefficients. By 
contrast, the denominators of Pade approximants satisfy a 3-term recur- 
rence relation with varying coefficients which are difficult to calculate and 
whose behavior is apparently rather chaotic (see §5.2.3 below). 

5.2. Related topics and open problems. In this section we discuss several ques- 
tions pertaining to our approximation scheme that seem to have important applica- 
tions not only to the convergence theory of Pade approximants (|Sni~|SSl) but also 
to complex Sturni-Liouville problems (0), modern statistical physics (' |28[I29| ') and 
the theory of (general) orthogonal polynomials f |16[ l26[|SHIl38p . 

5.2.1. Let {pniz)}neN be a family of monic complex polynomials. In addition to 
the asymptotic ratio distribution ly introduced in Definition El one can associate to 
this family an asymptotic probability measure fi defined as follows. 

Definition 14. To a degree dn complex polynomial Pn(-z) we associate a finite 
probability measure /x„ called the root counting measure of Pniz) by placing the 
mass at any of its simple roots. If some root of Pn{z) is multiple we place at 
this point a mass equal to the multiplicity of the root divided by dn- The limit 
lim„_>oo (if is exists in the sense of the weak convergence of measures) is called 
the asymptotic root counting measure of the family {pn(z)}neN and denoted by /x. 
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It is clear from Definitions |H1 and II4I that for any given polynomial family 
{pn{z)}ni£N the supports of ly and /i coincide. 

Problem 1. Find the relation between the asymptotic root counting measure fj, and 
the asymptotic ratio distribution ly for general polynomial families. Is it true that 
v depends only on fi, i.e., can two polynomial families with the same asymptotic 
root counting measure have different asymptotic ratio distributions? 

A more concrete question about the asymptotic root counting measure fj. may 
be formulated as follows. Take a sequence { ((3i,„(z), . . . , Qk,niz))} of /c-tuples 
of polynomials and consider the polynomial family {pniz)}neN satisfying the recur- 
rence relation 

k 
1=1 

with initial fc-tuple e.g. p^k+iiz) = p-k+2iz) = . . . = p_i(z) = 0, po{z) = 1. We say 
that the polynomial family {pniz)}nGN belongs to the (generalized) Nevai class if the 
sequence {{Qi,n{z), ■ ■ ■ ,Qk,n{z))}^^^j^ converges pointwise (and coefficientwise) to 

a fixed /c-tuple of polynomials { . . . , Qk{z)) } when n ^ oo. Recall from [^H] 
that the (ordinary) Nevai class of orthogonal polynomials consists of all polynomial 
families {pn{z)}nGfi satisfying a 3-term recursion of the form 

ZPn{z) = an+lPn+l{z) + bnPn{z) + 

where a„ > 0, 6„ G R for n G N and lim„_>oo an — a > Q, lim„^oo 6n = 6 G IR. 
Theorem 5.2 of |26| claims that the asymptotic root counting measure exists for 
any orthogonal polynomial family in Nevai's class and its density is given by ^{x) = 
— , , where a ~ b — 2a, [3 = b + 2a. A classical result of Szego states that 

in this case the density of v equals v{x) = -^X^ °'\ 

Problem 1'. Calculate the asymptotic root-counting measure /x (if it exists) for 
any polynomial family in the generalized Nevai class. 

Problem 1' is important and nontrivial already in the context of the present 
paper, namely for recurrences with fixed polynomial or rational coefficients. Quite 
recently we were informed by A. Sokal that the asymptotic root-counting measure 
/i is conjecturally given by the distributional derivative of log \ydom{z)\- 

5.2.2. Let us consider a finite recurrence relation with fixed polynomial coefficients 
and arbitrarily prescribed initial values of the form 

k 

Pn+l{z) =^Qi{z)pn^i{z). (5.1) 

4=1 

Denote by Sg the induced equimodular discriminant of the asymptotic symbol 
equation of H5.1|l viewed as a family of polynomials in the variable z (cf. Definitions^ 
and Q . The following fact was observed by the authors in extensive computer 
experiments and needs both a mathematical ground and a conceptual explanation. 

Conjecture 2. For any polynomial family {pn(-z)}neN satisfying a finite recurrence 
relation of the form (|5.1(l such that degp„(z) = n for n G N the zeros of any two 
consecutive polynomials p„+i(z) and Pn{z) interlace along the curve for all 
sufficiently high degrees n. 

An example illustrating the interlacing phenomenon conjectured above is shown 
in Figure 13 The large fat points in this picture are as usual the branching points. 
Normal size fat points are the zeros of ^41(2;) while thin points represent the zeros 
of Pioiz). One notices a typical interlacing pattern between these two sets of zeros. 
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Figure 3. Zeros of P4o{z) and P4i{z) satisfying the 4-terni recur- 
rence relation p„-(-i(z) = (z + 1 - /)|7„(z) + (z + l)(z - /)p„_i(z) + 
(z^ + 10)pn-2{z) with po{z) = z^ - z^ + I, pi{z) = z- I + 2 and 
P2iz) = (2 + /)z2-8. 



A simple case of this phenomenon was studied in ^21 (see Conjecture QJ. Sim- 
ilar interlacing properties for complex zeros along specified curves were noticed in 
e.g. IS]. Some caution is actually required when defining the interlacing property 
since the zeros of Pn{z) do not lie exactly on Sq. In the case when Sg is a smooth 
curve we may proceed as follows. Identify some sufficiently small neighborhood 
N{Eq) C of Sq with the normal bundle to Sq by equipping N{Eq) with a pro- 
jection onto Sq along the fibers which are small curvilinear segments transversal 
to Sq. Then one can say that two sets of points in NCBq) interlace if their pro- 
jections on Sq interlace in the usual sense. If Sg has singularities one should first 
remove some sufficiently small neighborhoods of these singularities and proceed in 
the above way on the remaining part of Sg. The above conjecture then says that 
for any sufficiently small neighborhood N{Eq) of Sq and any of its identifications 
with the normal bundle to Sq there exists uq such that the interlacing property 
for the roots of Pn{z) and Pn+i{z) holds for all n > uq. It is worth mentioning that 
the restriction degp„(z) = n is apparently unnecessary. 

5.2.3. Consider the following analog of recursion (|1.3|l with variable coefficients 

-qn(z) = 2^ — -—q„-^(z), 

^k,n\Z) 

where Pj^n{z) € C[z], < j < fc — 1, n e N. Assume that the sequence of k- 
tuples {{^Pk^n{z), Pk-i,niz), ■ ■ ■ , Po,n{z))}n&i couverges (coefficient wise) uniformly 
on compact subsets of C to {(Pfc(z), Pfc_i(z), . . . , Po{z))}. As in §1 we set r„(z) = 
g^"i^z) n £ N and we denote by ydom{z) the dominant root of the limit equation 

k 

^F^_,(z)/-* = 0. 

4=0 

Problem 2. Is it true that the sequence of rational functions {r„(z)}„gN converges 
uniformly (in the complement of some set of zero Lebesgue measure) to ydom{z)'^ 
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Several results in this direction were obtained in ^J. Note that an affirmative 
answer to Problem |21 would imply in particular that the coefficients of the 3-term 
recurrence relation H3.4(l can have no limits. 

5.2.4. Let f{z) be an algebraic function and {rn{z)}neN be a sequence of rational 
approximants to f{z) given by Theorem|21 It would be interesting to know what is 
the order of contact (in the sense of of r„(z) and f{z) at oo. More generally, 
in which cases does the sequence {?'„(z)}„gN coincide (up to a finite number of 
terms) with the sequence {[n/n](z)}„gN of diagonal Pade approximants to f{z)7 

5.2.5. A natural problem arising in the present context is the study of the strati- 
fication of the space of algebraic functions according to the topological structure of 
the induced equimodular discriminant S^. More precisely, consider the space Ak 
of all algebraic equations of degree fc. An equation P{y, z) £ Ak is called branching 
nondegenerate if all its branching points in the z-plane are simple. As noted in §4 
the endpoints of the curves in can only occur at the branching points of P{y, z) 
(of. JOlEl)- Thus the topological structure of S/ can change if and only if either 
(i) a pair of branching points collapses and P{y, z) becomes branching degenerate 
or (ii) a 4-tuple of values of f{z) have the same (maximal) absolute value. (Note 
that for a given z only the branching points for which \y{z) \ is maximal are relevant 
to S/.) What can be said about this stratification? In particular, how many top- 
dimensional strata are there and what are their (co)homology groups? A similar 
stratification related to certain quadratic differentials was studied in [H]. 

5.2.6. Well-known results on diagonal Pade approximants at oo going back to 
Markov ,25. claim that if the analytic function under consideration is the Cauchy 
transform of a positive measure supported on a real interval then the poles of all 
Fade approximants belong to this interval and thus both the Pade Conjecture and 
Nuttall's Conjecture are trivially valid for such functions (see also ^3]). On the 
other hand, we saw in §3 that already algebraic functions which are obtained as 
the Cauchy transform of a real but not necessarily positive measure supported on 
an interval might violate Nuttall's conjecture. It is therefore natural to investigate 
the Pade Conjecture and Nuttall's Conjecture for algebraic functions representable 
as Cauchy transforms of positive measures supported on compact subsets of C. An 
intriguing class of such functions was recently discovered in ^J. Let fc G N and 
consider the algebraic curve F given by the equation 

fc 

Y,Q^{zW = 0, (5.2) 

where Qi{z) — X]j=o '^i.j-^"' with degQi < i for < i < fc. The curve P is called of 
general type if the following two nondegeneracy requirements are satisfied: 

(i) degQfe(z) = fc; 

(ii) all roots of the (characteristic) equation 

ak,k + ak-i^k-it + . . . -f aofit'' = (5.3) 
have pairwise distinct arguments (in particular, is not a root of (15.31) ). 

In ^] we showed that all the branches of an algebraic curve P of general type 
vanish at oo and that each such branch is representable (up to a constant factor) 
as the Cauchy transform of a certain compactly supported positive measure. We 
believe that the answer to the following question is affirmative. 

Problem 3. Is it true that all the branches of an arbitrary algebraic curve P of 
general type satisfy both the Pade Conjecture and Nuttall's Conjecture near oo? 
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5.3. The geometry of the algebraic scheme and multidimensional contin- 
uous fractions. Let us finally discuss certain similarities between the scheme for 
approximating algebraic functions proposed in this paper and some recent devel- 
opments in multidimensional continuous fractions, see [Tl 1211 1^ I23j . Recall first 
that the conjugates of an algebraic number ^ are by definition all algebraic numbers 
other than ^ satisfying the irreducible algebraic equation with integer coefficients 
defining ^. Notice that Theorem |21 has an obvious analog for algebraic numbers. 

Definition 15. An algebraic number ^ is called dominant if its absolute value is 
strictly larger than the absolute values of all its conjugates. 

Clearly, any dominant algebraic number ^ is necessarily real. Assume addition- 
ally that the polynomial with integer coefficients P^{t) — aot'' + ait'^^^ + . . . + ak 
defining ^ is monic, that is, ao = 1. (This assumption is implicit in all the ref- 
erences quoted above where the defining polynomial is always the characteristic 
polynomial of a matrix with integer coefhcients.) Consider the k x k matrix with 
integer coefficients given by 

/ -ai -a2 -as . . . -ak \ 
1 ... 



1 ... 



V ... 1 / 

The characteristic polynomial of coincides with P^{t). Set Vq — (1,0, .. . ,0)* 
and let v„ = (ui'^^ . . . , wi"^^)* := M^Vq € Z'^ for n g N. Using the same 

methods as in §2 one can easily show that the sequence of vectors {v„}„gN thus 
defined enjoys the following property: 

Lemma 12. For any iG{l,...,A; — 1} one has lim„_+oo ""(i) = C- 

Matrices with integer coefficients whose characteristic polynomials have only 
distinct positive zeros are called hyperbolic. Such matrices are the main ingredient 
in the vast program of studying multidimensional continuous fractions suggested 
by V. Arnold (following the ideas of F. Klein). More precisely, assuming that 
is a hyperbolic matrix one gets a decomposition of M'^ into 2'^ orthants spanned by 
all possible choices of directions given by the k one-dimensional eigenspaces of M^. 
The sail of such an orthant is the convex hull of the set of all integer points in M*^ 
contained in the orthant. If all the eigenspaces of are irrational then Dirichlet's 
theorem on unities implies that the subgroup Gmj C GLk{Z) acting on M'^ and 
preserving all the eigenspaces of is isomorphic to Z2 x Z'^"^. The combinatorial 
properties of the sails defined by and the action of the group Ga/^ on these sails 
are the subject of interesting and delicate generalizations of Lagrange's theorem 
on the periodicity of continuous fractions for quadratic irrationalities. A natural 
problem that arises in this context is as follows. 

Problem 4. Let ^ be a dominant algebraic number and assume that its associated 
matrix is hyperbolic. Study the behavior of the sequence {v„}„gN with respect 
to the union of the sails defined by M^. 
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